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Figure 1 A Nucleotide sequence of inserted environmental DNA (029cel) 

SEQIDNOrl 



ATCAACACGC TGGAAAGTAA TTTCAAGGGT AAGGCCATCG GTTGCCGCCG 50 

GGGTAGAAAT GTGCGGTTGG ATTTCGTTGA GCGGCGTCGC CGGCGTTCCA 100 

CCGAGGGCAT AGCGCAGCAG GTTGGCGATG CCACCGGTGA GGCCTTCGGG 150 

GCCGCCTACG ATGTTGTGCT CAGCCGCCCA TGCGATGTAG CCGTCCGGCT 200 

CGGGTTCGCT CGCGGGGGTG AAGAAGACAA TGTCGTCGAG ATAAAGGTTG 250 

CCGCTTCCGC TCTCAACGCC .GCCGAGGTTG AATTGGATTT CGCAAATTCT 300 

CGTTAGGTCC AGCACGGAAT CGCCGACGAG GTCGGCTATG GGAATCTGAA 350 

TGCGCCCATA GGGTTGGGTA CGCGGAAGGG ACACGTAGGG ACCCACTTTG 400 

TCATTGGGCG AGACGAGCCG GACAAAGATT TGGTGCGCCG CCTGCGAGGG 450 

GCCTTGGAGG GCGAGAGAAA GGTACGTGAG GGCGCTGATG TCGTGCGTGG 500 

GACCGTCTCC CCAGTTGTCG AGATTGAGCC CAAATCCGGC CCACCATCCG 550 

GCGATAGTGT AGCTCCAATG GTAGTGACGC TCACCCTCGA AGCCGCCGCT 600 

GGAGAGTTCC TGCAAGCCGT CGCCCCAAAT GCCCGTGATG AGCGTTGCCT 650 

CGTCACGGTA GAT C AC AAGT TCGGCGGCGG GTGCCGGGGG AAGATCGCCT 700 

TGAGTGATCA CGAGAGTGGC GGTGGCGCTG CCTTCGTGAT TAGGGTCGGT 750 

AATGGTGGCG ACGACCGTGT AGCTACCGGG CCCCACTGGC GCATGGGTGG 800 

AACCGTTGTA GGTAAAGGAG ACGTCAAGCC CCACGGGATG GGTCTCGGCA 850 

AGAGCGGCCT TGGGGGTGCC GTCGAAAACG TGTTCCAAAT TGGAGAGCGT 900 

GATGGTGGCG GGTGCCTTGA GCACAGTCAC AGAAACAGTG GATTGCACGG 950 

GATCGTGCGC TGCCGTGTCT GCAGGTGTGA AGACCACGCT GTAAAAACGG 1000 

GTTCCGGCGG ACGGTGCAAG GCCGGACAGG ACAAAGGCAA AGTCGCCGGG 1050 

GACGGCGGCT ACTCCGCCGC TCAGGCCGGC CTCCGCAAGG GTTTGCCCGA 1100 

AGGTGATGGG TGGGGCTGTG GGCCACATCT CCACAAGGCC GGTGTCCCCC 1150 

TCGTCACGCA CCGGCATGAG GGCGGAGAGG AGATGAATGT AACTGGCTTG 1200 

GTAATTGATG TCGGGCTCGG TGATTTCCCA TGAGTTCTCC GGCCAAAAAC 1250 

CATTCCAATC AAGGTAGGCT TTTTGCACGG GTTGGTCTCG GATCGCCTGA 1300 

ATGCTTCCGC TGTATTTGGG CATTGGGACC CGCCCGAAAG AAAACCAGGA 1350 

GCGGGACCGT AGAGTGAAGT GAGGGCATTG TCCCAGTCCG GCCATCGCGG 1400 

AACCAATGGT GGTAGATTTC ATTGGCTGCA CGGTCAGCGC CGCTGGCATA 1450 

CATGTTGCTA AGATAGACCA TGCCCATTGG GTTCACTCCG TGGAGATAGT 1500 

GCAGGTAGCC CATCGCGGCA TCGCGATGCG CGGCCGCGTC GGCGGGGTTG 1550 

AGCCCAAGCC TCCGTACCCC CTCGAAGAAA AAGCCAGCCT GAGACTTTGT 1600 

TTTGTTCGAG CCCCACGTGT AATCCTGATC CTTCAGGTAG GCGCGGTAGG 1650 

CGTCGGTCTG GTTATTCCAT GCACCGAGAA ACTCCCCACC GTTTATAGAA 1700 

GCCGCCATCC GGTTGCGGAT GTCGGCAGAG ACGCTAGGCG TCGCTCCCGG 1750 

GAGGGTCGTG TAGTGGGCGA GAGCTTTTTG TAGCTCACCT TGAAAGGGGA 1800 

AGAAATACCA CCACTGCACG GGCTCCATAT CGAGATAGCG CACATCGAAG 1850 

AAATCGCGAT AGACCGCACC GCCCGTGCGC TCGAAGAGCA TGGCGGCGGC 1900 

CATCACACGG TTGGCTAGCG TATCGTGGGC ATTGCGCGAG GGGCTCACGG 1950 

AAGCAAATCC GGTGTTGTCG AAAGGCACAT GAGGATGGAC CATGGTCCAA 2000 

TTCCATGCGG CGATGGCAGC GGATTCGAGG GTGACGGCAT AATCGCTCAT 2050 

GCCTACGCTC TCAAAGACAG TCGCCCCGAG GGCGAAAGCG GCGGCAGCCA 2100 

TGGCAGTGGC CTCGGTCGAG ACGGGGCCGT AGTAACGCGG ATGGGTGTCG 2150 

GTGCTCGGCG GGCTGGCGCT CTGGTGCCCC GTCACGGAAA CTTTCCCGAG 2200 

AATAGCCCCG CTCGGCTCCT GCATGCGTAA GAGCCAGTCC ATTCCCCATT 2250 

TGACTTCGTC AAGCAGGTCG GGGACACCGT TGCCGGATTC CGGGATGCCA 2300 

AAATCATCGG TAAAGACGTC AGGCCGCCCT TGATAGGCAA GGAGCAGCTC 2350 

CAGGATGACG CGCCCCGTCC ACTCGCTGTA CTTGTTGAAA TCGCCCGCAT 2400 
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Figure 1B 

CGAACCAACC GCCGCTGAGA TCGCGCTCCA 

CAGATGGGGC GGCTGGCGAC GTCCTGCGGG 

GTTCGCGTGG GCGTAGGGCA CCTCCTTGGC 

AGAACATGCG CACGGCCTCG CGCAGGACAA 

ATGGCGAAAC TATCGGAATG AGTGTTGTTG 

GTGGCCGGGC TCGGCAACTA CCGTAAAATC 

CCGATTGAAT ATGGATGGCG CCGCCGTTCC 

AAAACCACGA CGCCATCGTT CACGCGACGG 

GCTGTAGCTC TCGGCGCTGT TCCAGCCAAT 

CCACCTTGGT GGCATCGGCG GGGTAACCGA 

TCGGTGTGGG TGGAGGCGAC GAGGGCGGAG 

AAAGCCCGCT GTCGGCCCGA TACCAAAAAA 

TCATAGCAGG ATGTGGATAC GGAAAGGGGG 

GCCCAACGCT TGGCGAAAAC TGGATGGTTG 

TTGAGCCAAA AGCTGCGGGC AATCCTTATT 

ATCGTCGGCG GCACGACTTT TCGATGGGCG 

AGGCGCGAGG CTGCAAACCT TATGAAAAAA 
CCCGGTCAAA ATCCAGTCAA GGTTTGTTCA 
GGCACAGTCG AGCCATCAGC AGTCGCATTG 
TGTGCAAATG ACCGCTGCCG AAGGAACTGT 
CCTCGCCAAG 
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Figure 2 

The nucleotide sequence of 029cel ORF 



ATGAATTTTT CTCTCCCTAT TCGTTTTTTT 
CTTTTTCTTG CTGCTCATGG GCAGCTCCGC 
CCGATAAAAT CCGCATCGAC CAATTCGGTT 
GTGGCGGTGA TCGCCGACCC GCAGATTGGC 
CAGCCCCGGC GCAACGCTGG AGGTCCGTCG 
TTTTCTCCGG CTCACCGGTC CCGTGGAACG 
TCGGGAGACC GCGTGTGGTG GTTTGATTTT 
CCACTACCGC ATCCACGATC CTGCCAACAA 
CCATTGGCGC GGATGTTTAC GATGTTGTCC 
TTCTTCTATC AGCGCTCCGG GTTTGCCAAG 
GAACTGGGCC GATGCCGCTT CTCACCCGCA 
TCTGGGATAT GGGGAATGCC TCCTTGGAGC 
TTCGATGCGG GCGATTTCAA CAAGTACAGC 
CCTGGAGCTG CTCCTTGCCT ATCAAGGGCG 
ATTTTGGCAT CCCGGAATCC GGCAACGGTG 
GTCAAATGGG GAATGGACTG GCTCTTACGC 
TATTCTCGGG AAAGTTTCCG TGACGGGGCA 
GCACCGACAC CCATCCGCGT TACTACGGCC 
GCCATGGCTG CCGCCGCTTT CGCCCTCGGG 
AGGCATGAGC GATTATGCCG TCACCCTCGA 
GGAATTGGAC CATGGTCCAT CCTCATGTGC 
GCTTCCGTGA GCCCCTCGCG CAATGCCCAC 
GATGGCCGCC GCGATGCTCT TCGAGCGCAC 
ATTTCTTCGA TGTGCGCTAT CTCGATATGG 
TTCTTCCCCT TTCAAGGTGA GCTACAAAAA 
CCTCCCGGGA GCGACGCCTA GCGTCTCTGC 
CGGCTTCTAT AAACGGTGGG GAGTTTCTCG 
GACGCCTACC GCGCCTACCT GAAGGATCAG 
CAAAACAAAG TCTCAGGCTG GCTTTTTCTT 
GGCTCAACCC CGCCGACGCG GCCGCGCATC 
CTGCACTATC TCCACGGAGT GAACCCAATG 
CATGTATGCC AGCGGCGCTG ACCGTGCAGC 
GGTTCCGCGA TGGCCGGACT GGGACAATGC 
CCGCTCCTGG TTTTCTTTCG GGCGGGTCCC 
GCATTCAGGC GATCCGAGAC CAACCCGTGC 
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Figure 3 

The translated 029cel protein composed of 581 amino acids 



MNFSLPIRFF GIGPTAGFFL LLMGSSALVA STHTDKIRID QFGYPADATK 50 

VAVIADPQIG WNSAESYSPG ATLEVRRVND GVWFSGSPV PWNGGAIHIQ 100 

SGDRVWWFDF TWAEPGHYR IHDPANNTHS DSFAIGADVY DWLREAVRM 150 

FFYQRSGFAK EVPYAHANWA DAASHPQDVA SRPIWDMGNA SLERDLSGGW 200 

FDAGDFNKYS EWTGRVILEL LLAYQGRPDV FTDDFGIPES GNGVPDLLDE 250 

VKWGMDWLLR MQEPSGAILG KVSVTGHQSA SPPSTDTHPR YYGPVSTEAT 300 

AMAAAAFALG ATVFESVGMS DYAVTLESAA IAAWNWTMVH PHVPFDNTGF 350 

ASVSPSRNAH DTLANRVMAA AMLFERTGGA VYRDFFDVRY LDMEPVQWWY .400 

FFPFQGELQK ALAHYTTLPG ATPSVSADIR NRMAASINGG EFLGAWNNQT 450 

DAYRAYLKDQ DYTWGSNKTK SQAGFFFEGV RRLGLN P ADA AAHRDAAMGY 500 

LHYLHGVNPM GMVYLSNMYA SGADRAANEI YHHWFRDGRT GTMPSLHSTV 550 

PLLVFFRAGP NAQIQRKHSG DPRPTRAKSL P 581 



